Contrast Subgraph Mining from Coherent Cores

نویسندگان

  • Jingbo Shang
  • Xiyao Shi
  • Meng Jiang
  • Liyuan Liu
  • Timothy Hanratty
  • Jiawei Han
چکیده

Graph paŠern mining methods can extract informative and useful paŠerns from large-scale graphs and capture underlying principles through the overwhelmed information. Contrast analysis serves as a keystone in various €elds and has demonstrated its e‚ectiveness in mining valuable information. However, it has been long overlooked in graph paŠern mining. Œerefore, in this paper, we introduce the concept of contrast subgraph, that is, a subset of nodes that have signi€cantly di‚erent edges or edge weights in two given graphs of the same node set. Œemajor challenge comes from the gap between the contrast and the informativeness. Because of the widely existing noise edges in real-world graphs, the contrast may lead to subgraphs of pure noise. To avoid such meaningless subgraphs, we leverage the similarity as the cornerstone of the contrast. Speci€cally, we €rst identify a coherent core, which is a small subset of nodes with similar edge structures in the two graphs, and then induce contrast subgraphs from the coherent cores. Moreover, we design a general family of coherence and contrast metrics and derive a polynomial-time algorithm to eciently extract contrast subgraphs. Extensive experiments verify the necessity of introducing coherent cores as well as the e‚ectiveness and eciency of our algorithm. Real-world applications demonstrate the tremendous potentials of contrast subgraph mining. ACM Reference format: Jingbo Shang1, Xiyao Shi1, Meng Jiang2, Liyuan Liu1, Timothy HanraŠy3, Jiawei Han1. 2018. Contrast Subgraph Mining from Coherent Cores. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining, London, UK, August 2018 (KDD’18), 9 pages. DOI: 10.475/123 4

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accurate Classification of Protein Structural Families Using Coherent Subgraph Analysis

Protein structural annotation and classification is an important problem in bioinformatics. We report on the development of an efficient subgraph mining technique and its application to finding characteristic substructural patterns within protein structural families. In our method, protein structures are represented by graphs where the nodes are residues and the edges connect residues found wit...

متن کامل

Parallel Subgraph Mining on Hybrid Platforms: Hpc Systems, Multi-cores and Gpus

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x

متن کامل

Mining Density Contrast Subgraphs

Dense subgraph discovery is a key primitive in many graph mining applications, such as detecting communities in social networks and mining gene correlation from biological data. Most studies on dense subgraph mining only deal with one graph. However, in many applications, we have more than one graph describing relations among a same group of entities. In this paper, given two graphs sharing the...

متن کامل

A Scalable Parallel Approach for Subgraph Census Computation

Counting the occurrences of small subgraphs in large networks is a fundamental graph mining metric with several possible applications. Computing frequencies of those subgraphs is also known as the subgraph census problem, which is a computationally hard task. In this paper we provide a parallel multicore algorithm for this purpose. At its core we use FaSE, an efficient network-centric sequentia...

متن کامل

Cohesive Subgraph Mining on Attributed Graph

Finding cohesive subgraphs is a fundamental graph problem with a wide spectrum of applications. In this paper, we investigate this problem in the context of attributed graph, where each vertex is associated with content (e.g., geo-locations, tags and keywords). To properly capture the cohesiveness of the vertices in a subgraph from both graph structure and vertices attribute perspectives, we ad...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.06189  شماره 

صفحات  -

تاریخ انتشار 2018